Learning Prior Feature and Attention Enhanced Image Inpainting

نویسندگان

چکیده

AbstractMany recent inpainting works have achieved impressive results by leveraging Deep Neural Networks (DNNs) to model various prior information for image restoration. Unfortunately, the performance of these methods is largely limited representation ability vanilla Convolutional (CNNs) backbones. On other hand, Vision Transformers (ViT) with self-supervised pre-training shown great potential many visual recognition and object detection tasks. A natural question whether task can be greatly benefited from ViT backbone? However, it nontrivial directly replace new backbones in networks, as an inverse problem fundamentally different To this end, paper incorporates based Masked AutoEncoder (MAE) into model, which enjoys richer informative priors enhance process. Moreover, we propose use attention MAE make learn more long-distance dependencies between masked unmasked regions. Sufficient ablations been discussed about models paper. Besides, experiments on both Places2 FFHQ demonstrate effectiveness our proposed model. Codes pre-trained are released https://github.com/ewrfcas/MAE-FAR.KeywordsImage inpaintingAttentionVision transformer

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image alignment via kernelized feature learning

Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...

متن کامل

Generative Image Inpainting with Contextual Attention

Recent deep learning based approaches have shown promising results for the challenging task of inpainting large missing regions in an image. These methods can generate visually plausible image structures and textures, but often create distorted structures or blurry textures inconsistent with surrounding areas. This is mainly due to ineffectiveness of convolutional neural networks in explicitly ...

متن کامل

Study Of Image Inpainting Based On Learning

In this paper, we construct a actual system of image inpainting based on the image inpainting system model[1] which was proposed before, in order to repair more types, more broken images in the different field and restore them more efficiently ,because there is no one “universal” or more general algorithm can repair all types of images. The proposed system integrates the commonly used several t...

متن کامل

Shift-Net: Image Inpainting via Deep Feature Rearrangement

Deep convolutional networks (CNNs) have exhibited their potential in image inpainting for producing plausible results. However, in most existing methods, e.g., context encoder, the missing parts are predicted by propagating the surrounding convolutional features through a fully connected layer, which intends to produce semantically plausible but blurry result. In this paper, we introduce a spec...

متن کامل

Enhanced and Efficient Image Retrieval via Saliency Feature and Visual Attention

263 Abstract—In the real world applications such as landmark search, copy protection, fake image detection, partial duplicate image retrieval is very important. In the internet era users regularly upload images which are partially duplicate images on the domains like facebook, instagram and whatsapp etc. The partial image is only part of whole image, and the various kind of transformation invol...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-19784-0_18